Ccc-bicluster Analysis for Time Series Gene Expression Data

نویسندگان

  • D. Soundaravalli
  • S. Thilagavathi
چکیده

Many of the biclustering problems have been shown to be NP-complete. However, when they are interested in identify biclusters in time series expression data, it can limit the problem by finding only maximal biclusters with contiguous columns. This restriction leads to a well-mannered problem. Its motivation is the fact that biological processes start and conclude in an identifiable contiguous period of time, leading to increased (or decreased) activity of sets of genes forming biclusters with adjacent columns. In this context, propose an algorithm that finds and reports all maximal adjacent column coherent biclusters (CCC-Biclusters), in time linear in the size of the expression matrix. Each relevant CCC-Bicluster identified corresponds to the detection of a coherent expression pattern shared by a group of genes in a adjacent subset of time-points and identifies a potentially relevant regulatory module. The linear time complexity of CCC-Biclustering is acquired by manipulating a discretized version of the gene expression matrix and using efficient string dealing out techniques based on suffix trees. The effectiveness was obtained by applying the algorithm to the transcriptomic expression patterns stirring in Saccharomyces cerevisiae in response to heat stress. Key Words— Biclustering, gene expression data,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Correlated CCC-Biclusters from Gene Expression Data

Several non-supervised machine learning methods have been used in the analysis of gene expression data obtained from microarray experiments. Recently, biclustering, a non-supervised approach that performs simultaneous clustering on the row and column dimensions of the data matrix, has been shown to be remarkably effective in a variety of applications. The goal of biclustering is to find subgrou...

متن کامل

e-CCC-Biclustering: Related work on biclustering algorithms for time series gene expression data

This document provides supplementary material describing related work on biclustering algorithms for time series gene expression data analysis. We describe in detail three state of the art biclustering approaches specifically design to discover biclusters in gene expression time series and identify their strengths and weaknesses.

متن کامل

Efficient Biclustering Algorithms for Identifying Transcriptional Regulation Relationships Using Time Series Gene Expression Data

Biclustering algorithms have shown to be remarkably effective in a variety of applications. Although the biclustering problem is known to be NP-complete, in the particular case of time series gene expression data analysis, efficient and complete biclustering algorithms, are known and have been used to identify biologically relevant expression patterns. However, these algorithms, namely CCC-Bicl...

متن کامل

An Efficient Biclustering Algorithm for Finding Genes with Similar Patterns in Time-series Expression Data

Biclustering algorithms have emerged as an important tool for the discovery of local patterns in gene expression data. For the case where the expression data corresponds to time-series, efficient algorithms that work with a discretized version of the expression matrix are known. However, these algorithms assume that the biclusters to be found are perfect, in the sense that each gene in the bicl...

متن کامل

Recent patents on biclustering algorithms for gene expression data analysis.

In DNA microarray experiments, discovering groups of genes that share similar transcriptional characteristics is instrumental in functional annotation, tissue classification and motif identification. However, in many situations a subset of genes only exhibits a consistent pattern over a subset of conditions. Although used extensively in gene expression data analysis, conventional clustering alg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016